REMARKS 

Claims 1-4, 6-18, 20, and 22-35 are now pending in the application. Claims 1-35 
stand rejected. The Examiner is respectfully requested to reconsider and withdraw the 
rejection(s) in view of the amendments and remarks contained herein. 
Rejection Under 35 U.S.C. §103 

The Examiner objects to claim 20 because it depends from cancelled claim 19. 
This objection is respectfully traversed. 

Applicants have amended claim 20 herein to render it dependent from claim 18. 

Accordingly, Applicants respectfully request the Examiner reconsider and 
withdraw the objection to claim 18. 
Rejection Under 35 U.S.C. § 103 

Claims 1-4, 6-18, 20, and 22-35 stand rejected under 35 U.S.C. § 103(a) as 
being unpatentable over Anderson (U.S. Pat. No. 6,499,016), in view of Li et al. (U.S. 
Pat. No. 6,397,181) and McLean (US 2002/0099456). This rejection is respectfully 
traversed. 

Anderson is generally directed toward automatically storing and presenting digital 
images using a speech-based command language. In particular, the Examiner relies on 
Anderson to teach textually tagging captured images based on user speech. However, 
Anderson does not teach, suggest, or motivate allowing a user to select one of a 
plurality of speech recognition lexica by media capture activity. 

Li et al. is generally directed toward Voice annotation and retrieval of multimedia 
data. In particular, the Examiner relies on Li et al. to teach employing a user-selected, 
topically-focused speech recognition lexicon to convert user voice tags to text. 
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However, Li et al. do not teach, suggest, or motivate allowing a user to select one of a 
plurality of speech recognition lexica by media capture activity. 

McLean is generally directed toward user interfaces. In particular, the Examiner 
relies on McLean to teach a user interface adapted to permit a user to navigate between 
and select one of the lexica by media capture activity. However, McLean only teaches 
allowing a user to select a speech recognition filter by selecting a language, which is a 
characteristic of the user supplying the voice tag, as opposed to a characteristic of the 
captured media. Therefore, the user of the McLean modified system would be 
permitted only to select a language filter by tagging activity, and not by media capture 
activity. 

Applicants' claimed invention is generally directed toward textual tagging of 
captured media by recognition of voice annotations associated with the media. In 
particular, Applicants' claimed invention is directed toward using focused speech lexica 
that are focused toward specific media capture activities, wherein a user is permitted to 
select one of the lexica by media capture activity. For example, independent claim 1 , 
especially as previously presented, recites, "a user interface adapted to permit a user to 
navigate between and select one of the lexica by media capture activity." Independent 
claims 11 and 18, especially as previously presented, recite similar subject matter. The 
distinction between media capture activity and tagging activity is made in the originally 
filed specification at paragraph [0021]. In particular, it is clear that the media capture 
activity is not the capture of the voice tag. Rather, the media capture activity relates to 
the capture of media other than the user speech. This distinction is clear in the claim 
language by virtue of limitations reciting, "an audio input receptive of user speech 
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relating to a media capture activity in close temporal relation to the media capture 
activity ... a media tagger adapted to tag captured media with text generated by said 
speech recognizer based on close temporal relation between receipt of recognized user 
speech and capture of the captured media." In other words, receipt of the user speech 
and capture of the captured media are defined as two distinct events having a close 
temporal relation. Therefore, selection of a language filter for recognizing the voice tag 
according to the teachings of McLean at paragraph 127 merely allows selection of a 
speech lexicon by tagging activity, as opposed to media capture activity. Therefore, 
Anderson, Li et al., and McLean fail to teach, suggest, or motivate all of the limitations of 
the independent claims. These differences are significant. 

[0001] The differences between Applicants' claimed invention and the 
combined teachings of Anderson, Li et al., and McLean are significant because the user 
of Applicants' claimed invention can select a lexicon by characteristics of captured 
media instead of characteristics of a user tagging the media. For example, the lexica 
can be focused towards characteristics of captured media as discussed at paragraph 
[0027] of the originally filed Specification, such as "'portrait', 'landscape', and 'other* for 
still images ... 'sports', 'drama', 'comedy', and 'other' for multimedia streams ... 
subcategories ... in the categories, such as 'mountains', 'beaches', and 'cityscapes' for 
still images under the 'landscape' category." Therefore, the user can select the lexica 
by these types of categories and/or subcategories relating to characteristics of the 
captured media, instead of merely characteristics of the user tagging the captured 
media. In contrast, Anderson, Li et al., and McLean fail to teach, suggest, or motivate 
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allowing the user to select a language filter for recognizing the voice annotations based 
on anything more than language spoken by the user. 

Accordingly, Applicants respectfully request the Examiner reconsider and 
withdraw the rejection of independent claims 1,11, and 18 under 35 U.S.C. § 103(a), 
along with rejection on these grounds of all claims dependent therefrom. 
Conclusion 

It is believed that all of the stated grounds of rejection have been properly 
traversed, accommodated, or rendered moot. Applicant therefore respectfully requests 
that the Examiner reconsider and withdraw all presently outstanding rejections. It is 
believed that a full and complete response has been made to the outstanding Office 
Action, and as such, the present application is in condition for allowance. Thus, prompt 
and favorable consideration of this amendment is respectfully requested. If the 
Examiner believes that personal communication will expedite prosecution of this 
application, the Examiner is invited to telephone the undersigned at (248) 641-1600. 



Harness, Dickey & Pierce, P.L.C. 
P.O. Box 828 

Bloomfield Hills, Michigan 48303 
(248)641-1600 

GAS/JSB/kup 



Respectfully submitted, 
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